Overview

Dataset statistics

Number of variables26
Number of observations65
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.3 KiB
Average record size in memory210.0 B

Variable types

NUM22
CAT4

Reproduction

Analysis started2020-05-15 04:40:46.556139
Analysis finished2020-05-15 04:41:46.357578
Duration59.8 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

total_businesses is highly correlated with super_spreader and 10 other fieldsHigh correlation
super_spreader is highly correlated with total_businesses and 10 other fieldsHigh correlation
deaths is highly correlated with casesHigh correlation
cases is highly correlated with deaths and 1 other fieldsHigh correlation
population is highly correlated with super_spreader and 10 other fieldsHigh correlation
households is highly correlated with super_spreader and 10 other fieldsHigh correlation
females is highly correlated with super_spreader and 10 other fieldsHigh correlation
over65 is highly correlated with super_spreader and 10 other fieldsHigh correlation
white is highly correlated with super_spreader and 10 other fieldsHigh correlation
black is highly correlated with super_spreader and 10 other fieldsHigh correlation
asian is highly correlated with super_spreader and 10 other fieldsHigh correlation
hispanic is highly correlated with super_spreader and 10 other fieldsHigh correlation
poverty is highly correlated with super_spreader and 10 other fieldsHigh correlation
no_hsdiploma is highly correlated with super_spreader and 10 other fieldsHigh correlation
daily_cases is highly correlated with casesHigh correlation
state is highly correlated with county and 1 other fieldsHigh correlation
county is highly correlated with state and 1 other fieldsHigh correlation
county_name is highly correlated with county and 1 other fieldsHigh correlation
date is uniformly distributed Uniform
county is uniformly distributed Uniform
county_name is uniformly distributed Uniform
deaths has 22 (33.8%) zeros Zeros

Variables

countyfips
Real number (ℝ≥0)

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22468.384615384617
Minimum9001
Maximum44009
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum9001
5-th percentile9001
Q19007
median9013
Q344003
95-th percentile44009
Maximum44009
Range35008
Interquartile range (IQR)34996

Descriptive statistics

Standard deviation17158.69143
Coefficient of variation (CV)0.7636815784
Kurtosis-1.821198012
Mean22468.38462
Median Absolute Deviation (MAD)10
Skewness0.4856207988
Sum1460445
Variance294420691.5
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
4400957.7%
 
4400757.7%
 
4400557.7%
 
4400357.7%
 
4400157.7%
 
901557.7%
 
901357.7%
 
901157.7%
 
900957.7%
 
900757.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
900157.7%
 
900357.7%
 
900557.7%
 
900757.7%
 
900957.7%
 
ValueCountFrequency (%) 
4400957.7%
 
4400757.7%
 
4400557.7%
 
4400357.7%
 
4400157.7%
 

date
Categorical

UNIFORM

Distinct count5
Unique (%)7.7%
Missing0
Missing (%)0.0%
Memory size520.0 B
12apr2020
13
05apr2020
13
29mar2020
13
26apr2020
13
19apr2020
13
ValueCountFrequency (%) 
12apr20201320.0%
 
05apr20201320.0%
 
29mar20201320.0%
 
26apr20201320.0%
 
19apr20201320.0%
 

Length

Max length9
Median length9
Mean length9
Min length9

super_spreader
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count63
Unique (%)96.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3265.523076923077
Minimum416
Maximum8812
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum416
5-th percentile434.4
Q11004
median1713
Q35670
95-th percentile8473
Maximum8812
Range8396
Interquartile range (IQR)4666

Descriptive statistics

Standard deviation3002.572122
Coefficient of variation (CV)0.9194766202
Kurtosis-0.9674556485
Mean3265.523077
Median Absolute Deviation (MAD)768
Skewness0.9002426499
Sum212259
Variance9015439.347
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
126723.1%
 
100423.1%
 
819011.5%
 
848711.5%
 
835111.5%
 
41611.5%
 
92611.5%
 
89811.5%
 
567011.5%
 
42711.5%
 
Other values (53)5381.5%
 
ValueCountFrequency (%) 
41611.5%
 
42511.5%
 
42711.5%
 
43211.5%
 
44411.5%
 
ValueCountFrequency (%) 
881211.5%
 
860811.5%
 
852511.5%
 
848711.5%
 
841711.5%
 

total_businesses
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count64
Unique (%)98.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3826.6769230769232
Minimum494
Maximum10217
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum494
5-th percentile517.2
Q11187
median2028
Q36611
95-th percentile9858.2
Maximum10217
Range9723
Interquartile range (IQR)5424

Descriptive statistics

Standard deviation3484.975238
Coefficient of variation (CV)0.9107053739
Kurtosis-0.970659841
Mean3826.676923
Median Absolute Deviation (MAD)912
Skewness0.8960089325
Sum248734
Variance12145052.41
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
120223.1%
 
51111.5%
 
659011.5%
 
121211.5%
 
287411.5%
 
192811.5%
 
901411.5%
 
218011.5%
 
977911.5%
 
300911.5%
 
Other values (54)5483.1%
 
ValueCountFrequency (%) 
49411.5%
 
51111.5%
 
51311.5%
 
51611.5%
 
52211.5%
 
ValueCountFrequency (%) 
1021711.5%
 
997311.5%
 
996511.5%
 
987811.5%
 
977911.5%
 

county
Categorical

HIGH CORRELATION
UNIFORM

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Memory size520.0 B
Newport
 
5
Providenc
 
5
Middlesex
 
5
Washingto
 
5
Litchfiel
 
5
Other values (8)
40
ValueCountFrequency (%) 
Newport57.7%
 
Providenc57.7%
 
Middlesex57.7%
 
Washingto57.7%
 
Litchfiel57.7%
 
Tolland57.7%
 
Hartford57.7%
 
Kent57.7%
 
New Londo57.7%
 
Fairfield57.7%
 
Other values (3)1523.1%
 

Length

Max length9
Median length9
Mean length7.923076923
Min length4

state
Categorical

HIGH CORRELATION

Distinct count2
Unique (%)3.1%
Missing0
Missing (%)0.0%
Memory size520.0 B
Connecticut
40
Rhode Island
25
ValueCountFrequency (%) 
Connecticut4061.5%
 
Rhode Island2538.5%
 

Length

Max length12
Median length11
Mean length11.38461538
Min length11

cases
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count64
Unique (%)98.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1140.4923076923078
Minimum7
Maximum10529
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum7
5-th percentile20.2
Q174
median197
Q3751
95-th percentile5480.4
Maximum10529
Range10522
Interquartile range (IQR)677

Descriptive statistics

Standard deviation2114.847439
Coefficient of variation (CV)1.854328543
Kurtosis6.715747985
Mean1140.492308
Median Absolute Deviation (MAD)165
Skewness2.560858098
Sum74132
Variance4472579.691
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
27623.1%
 
127811.5%
 
15711.5%
 
5311.5%
 
13511.5%
 
29911.5%
 
4211.5%
 
16911.5%
 
4011.5%
 
3911.5%
 
Other values (54)5483.1%
 
ValueCountFrequency (%) 
711.5%
 
1111.5%
 
1711.5%
 
2011.5%
 
2111.5%
 
ValueCountFrequency (%) 
1052911.5%
 
743411.5%
 
671511.5%
 
553411.5%
 
526611.5%
 

deaths
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count33
Unique (%)50.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean61.261538461538464
Minimum0
Maximum707
Zeros22
Zeros (%)33.8%
Memory size520.0 B

Quantile statistics

Minimum0
5-th percentile0
Q10
median6
Q335
95-th percentile402.2
Maximum707
Range707
Interquartile range (IQR)35

Descriptive statistics

Standard deviation140.2807761
Coefficient of variation (CV)2.289867014
Kurtosis9.809842821
Mean61.26153846
Median Absolute Deviation (MAD)6
Skewness3.116651777
Sum3982
Variance19678.69615
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
02233.8%
 
134.6%
 
234.6%
 
434.6%
 
634.6%
 
2423.1%
 
723.1%
 
2023.1%
 
1811.5%
 
2111.5%
 
Other values (23)2335.4%
 
ValueCountFrequency (%) 
02233.8%
 
134.6%
 
234.6%
 
434.6%
 
511.5%
 
ValueCountFrequency (%) 
70711.5%
 
57911.5%
 
44711.5%
 
42911.5%
 
29511.5%
 

county_name
Categorical

HIGH CORRELATION
UNIFORM

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Memory size520.0 B
Tolland
 
5
Newport
 
5
Wndham
 
5
Washington
 
5
New London
 
5
Other values (8)
40
ValueCountFrequency (%) 
Tolland57.7%
 
Newport57.7%
 
Wndham57.7%
 
Washington57.7%
 
New London57.7%
 
Hartford57.7%
 
Middlesex57.7%
 
Kent57.7%
 
Fairfield57.7%
 
Providence57.7%
 
Other values (3)1523.1%
 

Length

Max length10
Median length9
Mean length8.153846154
Min length4

population
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean356778.07692307694
Minimum48900
Maximum944348
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum48900
5-th percentile48900
Q1126242
median163861
Q3634533
95-th percentile944348
Maximum944348
Range895448
Interquartile range (IQR)508291

Descriptive statistics

Standard deviation330681.4468
Coefficient of variation (CV)0.9268547262
Kurtosis-1.00054419
Mean356778.0769
Median Absolute Deviation (MAD)80786
Skewness0.8962529515
Sum23190575
Variance1.093502193e+11
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
18303157.7%
 
15126957.7%
 
94434857.7%
 
26888157.7%
 
85933957.7%
 
11653857.7%
 
16336857.7%
 
63453357.7%
 
12624257.7%
 
16386157.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
4890057.7%
 
8307557.7%
 
11653857.7%
 
12624257.7%
 
15126957.7%
 
ValueCountFrequency (%) 
94434857.7%
 
89473057.7%
 
85933957.7%
 
63453357.7%
 
26888157.7%
 

households
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean136789.15384615384
Minimum19553
Maximum349064
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum19553
5-th percentile19553
Q149111
median68833
Q3238171
95-th percentile349064
Maximum349064
Range329511
Interquartile range (IQR)189060

Descriptive statistics

Standard deviation123537.3302
Coefficient of variation (CV)0.903122263
Kurtosis-1.016484401
Mean136789.1538
Median Absolute Deviation (MAD)33616
Skewness0.8834899791
Sum8891295
Variance1.526147195e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6883357.7%
 
1955357.7%
 
23817157.7%
 
4911157.7%
 
6689257.7%
 
5523257.7%
 
4444957.7%
 
3521757.7%
 
34049157.7%
 
10740257.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
1955357.7%
 
3521757.7%
 
4444957.7%
 
4911157.7%
 
5523257.7%
 
ValueCountFrequency (%) 
34906457.7%
 
34049157.7%
 
32985757.7%
 
23817157.7%
 
10740257.7%
 

age
Real number (ℝ≥0)

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.184615384615384
Minimum37.3
Maximum47.2
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum37.3
5-th percentile37.3
Q140.3
median41.4
Q344.5
95-th percentile47.2
Maximum47.2
Range9.9
Interquartile range (IQR)4.2

Descriptive statistics

Standard deviation2.980888645
Coefficient of variation (CV)0.07066293287
Kurtosis-1.028916989
Mean42.18461538
Median Absolute Deviation (MAD)2.8
Skewness-0.1078271345
Sum2742
Variance8.885697115
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
47.257.7%
 
41.357.7%
 
37.357.7%
 
43.857.7%
 
44.257.7%
 
40.357.7%
 
41.457.7%
 
45.457.7%
 
40.157.7%
 
40.457.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
37.357.7%
 
37.557.7%
 
40.157.7%
 
40.357.7%
 
40.457.7%
 
ValueCountFrequency (%) 
47.257.7%
 
45.457.7%
 
4557.7%
 
44.557.7%
 
44.257.7%
 

income
Real number (ℝ≥0)

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean75086.38461538461
Minimum55233
Maximum92969
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum55233
5-th percentile55233
Q170223
median75578
Q381301
95-th percentile92969
Maximum92969
Range37736
Interquartile range (IQR)11078

Descriptive statistics

Standard deviation9572.411208
Coefficient of variation (CV)0.1274853125
Kurtosis-0.1467948938
Mean75086.38462
Median Absolute Deviation (MAD)5723
Skewness-0.1613734387
Sum4880615
Variance91631056.33
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
7831457.7%
 
7022357.7%
 
7136857.7%
 
5523357.7%
 
7557857.7%
 
6712857.7%
 
7723757.7%
 
8491657.7%
 
9296957.7%
 
8476157.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
5523357.7%
 
6477457.7%
 
6712857.7%
 
7022357.7%
 
7136857.7%
 
ValueCountFrequency (%) 
9296957.7%
 
8491657.7%
 
8476157.7%
 
8130157.7%
 
7831457.7%
 

females
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean182896.30969230767
Minimum25330.0
Maximum484385.03
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum25330
5-th percentile25330
Q165131.996
median84642
Q3326048
95-th percentile484385.03
Maximum484385.03
Range459055.03
Interquartile range (IQR)260916.004

Descriptive statistics

Standard deviation170561.8767
Coefficient of variation (CV)0.9325605146
Kurtosis-1.002678275
Mean182896.3097
Median Absolute Deviation (MAD)42515
Skewness0.8998129206
Sum11888260.13
Variance2.909135378e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
7544857.7%
 
4212757.7%
 
9258457.7%
 
65131.99657.7%
 
46008057.7%
 
8369257.7%
 
44502857.7%
 
2533057.7%
 
484385.0357.7%
 
32604857.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
2533057.7%
 
4212757.7%
 
5897857.7%
 
65131.99657.7%
 
7544857.7%
 
ValueCountFrequency (%) 
484385.0357.7%
 
46008057.7%
 
44502857.7%
 
32604857.7%
 
13417857.7%
 

over65
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean58599.230769230766
Minimum9406.0
Maximum147260.0
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum9406
5-th percentile9406
Q122405
median31051.998
Q393384
95-th percentile147260
Maximum147260
Range137854
Interquartile range (IQR)70979

Descriptive statistics

Standard deviation51352.42031
Coefficient of variation (CV)0.8763326692
Kurtosis-0.920291357
Mean58599.23077
Median Absolute Deviation (MAD)13701.998
Skewness0.915392738
Sum3808950
Variance2637071072
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
14726057.7%
 
14341957.7%
 
1825757.7%
 
4632057.7%
 
3645657.7%
 
1735057.7%
 
14241157.7%
 
2444257.7%
 
29628.00257.7%
 
31051.99857.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
940657.7%
 
1735057.7%
 
1825757.7%
 
2240557.7%
 
2444257.7%
 
ValueCountFrequency (%) 
14726057.7%
 
14341957.7%
 
14241157.7%
 
9338457.7%
 
4632057.7%
 

white
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean245109.46384615384
Minimum45111.0
Maximum588974.0
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum45111
5-th percentile45111
Q1115008
median145885
Q3390443.03
95-th percentile588974
Maximum588974
Range543863
Interquartile range (IQR)275435.03

Descriptive statistics

Standard deviation193156.4609
Coefficient of variation (CV)0.7880416281
Kurtosis-0.9625958839
Mean245109.4638
Median Absolute Deviation (MAD)58300
Skewness0.8631687787
Sum15932115.15
Variance3.730941841e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
14588557.7%
 
7128057.7%
 
4511157.7%
 
20418557.7%
 
12860357.7%
 
55342557.7%
 
16289857.7%
 
11500857.7%
 
390443.0357.7%
 
9679557.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
4511157.7%
 
7128057.7%
 
9679557.7%
 
11500857.7%
 
12860357.7%
 
ValueCountFrequency (%) 
58897457.7%
 
55342557.7%
 
54612057.7%
 
390443.0357.7%
 
20418557.7%
 

black
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31601.384615384617
Minimum621
Maximum114565
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum621
5-th percentile621
Q12513
median4299
Q351578
95-th percentile114565
Maximum114565
Range113944
Interquartile range (IQR)49065

Descriptive statistics

Standard deviation43575.70133
Coefficient of variation (CV)1.37891747
Kurtosis-0.6377126164
Mean31601.38462
Median Absolute Deviation (MAD)3678
Skewness1.085617897
Sum2054090
Variance1898841746
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5157857.7%
 
805057.7%
 
62157.7%
 
164057.7%
 
264957.7%
 
10632557.7%
 
9941257.7%
 
251357.7%
 
429957.7%
 
286357.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
62157.7%
 
164057.7%
 
208457.7%
 
251357.7%
 
264957.7%
 
ValueCountFrequency (%) 
11456557.7%
 
10632557.7%
 
9941257.7%
 
5157857.7%
 
1421957.7%
 

asian
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14810.307533846155
Minimum995.99994
Maximum49375.0
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum995.99994
5-th percentile995.99994
Q12406
median4956
Q325915.998
95-th percentile49375
Maximum49375
Range48379.00006
Interquartile range (IQR)23509.998

Descriptive statistics

Standard deviation17195.54166
Coefficient of variation (CV)1.161052303
Kurtosis-0.4714970472
Mean14810.30753
Median Absolute Deviation (MAD)3404
Skewness1.070372859
Sum962669.9897
Variance295686653.1
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
4937557.7%
 
1097257.7%
 
4632757.7%
 
701357.7%
 
995.9999457.7%
 
495657.7%
 
25915.99857.7%
 
336157.7%
 
3385057.7%
 
155257.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
995.9999457.7%
 
155257.7%
 
163257.7%
 
240657.7%
 
336157.7%
 
ValueCountFrequency (%) 
4937557.7%
 
4632757.7%
 
3385057.7%
 
25915.99857.7%
 
1097257.7%
 

hispanic
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean55434.53850000001
Minimum1421.0
Maximum182653.0
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum1421
5-th percentile1421
Q17624
median10941
Q3141206
95-th percentile182653
Maximum182653
Range181232
Interquartile range (IQR)133582

Descriptive statistics

Standard deviation69870.08204
Coefficient of variation (CV)1.26040703
Kurtosis-1.148170618
Mean55434.5385
Median Absolute Deviation (MAD)7027
Skewness0.8804839867
Sum3603245.003
Variance4881828365
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
18265357.7%
 
15778557.7%
 
14120657.7%
 
469357.7%
 
391457.7%
 
1094157.7%
 
8086.000557.7%
 
983557.7%
 
2776257.7%
 
15136157.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
142157.7%
 
391457.7%
 
469357.7%
 
762457.7%
 
8086.000557.7%
 
ValueCountFrequency (%) 
18265357.7%
 
15778557.7%
 
15136157.7%
 
14120657.7%
 
2776257.7%
 

amerindian
Real number (ℝ≥0)

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean694.4615384615385
Minimum43
Maximum2211
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum43
5-th percentile43
Q1118
median305
Q31257
95-th percentile2211
Maximum2211
Range2168
Interquartile range (IQR)1139

Descriptive statistics

Standard deviation664.311511
Coefficient of variation (CV)0.9565850291
Kurtosis-0.2911551456
Mean694.4615385
Median Absolute Deviation (MAD)262
Skewness0.8360519138
Sum45140
Variance441309.7837
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
139857.7%
 
125757.7%
 
21557.7%
 
6657.7%
 
5957.7%
 
82357.7%
 
11857.7%
 
94657.7%
 
30557.7%
 
4357.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
4357.7%
 
5957.7%
 
6657.7%
 
11857.7%
 
21557.7%
 
ValueCountFrequency (%) 
221157.7%
 
139857.7%
 
129457.7%
 
125757.7%
 
94657.7%
 

other
Real number (ℝ≥0)

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1423.9230784615386
Minimum26.0
Maximum6221.0
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum26
5-th percentile26
Q1178.99998
median320
Q32240
95-th percentile6221
Maximum6221
Range6195
Interquartile range (IQR)2061.00002

Descriptive statistics

Standard deviation2061.247894
Coefficient of variation (CV)1.447583739
Kurtosis0.6875558939
Mean1423.923078
Median Absolute Deviation (MAD)180
Skewness1.475244467
Sum92555.0001
Variance4248742.882
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
183.9999857.7%
 
622157.7%
 
380.0000357.7%
 
298.0000357.7%
 
267357.7%
 
178.9999857.7%
 
9557.7%
 
32057.7%
 
224057.7%
 
14057.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
2657.7%
 
9557.7%
 
14057.7%
 
178.9999857.7%
 
183.9999857.7%
 
ValueCountFrequency (%) 
622157.7%
 
541657.7%
 
267357.7%
 
224057.7%
 
380.0000357.7%
 

poverty
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37038.76916923077
Minimum3432.0002
Maximum98855.0
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum3432.0002
5-th percentile3432.0002
Q110908.999
median12650
Q381751
95-th percentile98855
Maximum98855
Range95422.9998
Interquartile range (IQR)70842.001

Descriptive statistics

Standard deviation38845.68304
Coefficient of variation (CV)1.048784393
Kurtosis-1.234514403
Mean37038.76917
Median Absolute Deviation (MAD)5859
Skewness0.8360452144
Sum2407519.996
Variance1508987091
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2614057.7%
 
3432.000257.7%
 
10908.99957.7%
 
679157.7%
 
939057.7%
 
1265057.7%
 
1124957.7%
 
8175157.7%
 
9866157.7%
 
9706357.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
3432.000257.7%
 
679157.7%
 
939057.7%
 
10908.99957.7%
 
1124957.7%
 
ValueCountFrequency (%) 
9885557.7%
 
9866157.7%
 
9706357.7%
 
8175157.7%
 
2614057.7%
 

no_hsdiploma
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count13
Unique (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24909.23083076923
Minimum3382.9998
Maximum65533.0
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum3382.9998
5-th percentile3382.9998
Q15215.0005
median9986
Q359205.004
95-th percentile65533
Maximum65533
Range62150.0002
Interquartile range (IQR)53990.0035

Descriptive statistics

Standard deviation26277.4944
Coefficient of variation (CV)1.054929981
Kurtosis-1.296375328
Mean24909.23083
Median Absolute Deviation (MAD)5264
Skewness0.8202652565
Sum1619100.004
Variance690506711.7
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
59205.00457.7%
 
3382.999857.7%
 
386357.7%
 
5215.000557.7%
 
6553357.7%
 
65169.99657.7%
 
919257.7%
 
1015757.7%
 
6918.000557.7%
 
505857.7%
 
Other values (3)1523.1%
 
ValueCountFrequency (%) 
3382.999857.7%
 
386357.7%
 
505857.7%
 
5215.000557.7%
 
6918.000557.7%
 
ValueCountFrequency (%) 
6553357.7%
 
65169.99657.7%
 
6489057.7%
 
59205.00457.7%
 
1525057.7%
 

t
Real number (ℝ≥0)

Distinct count5
Unique (%)7.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.425219281
Coefficient of variation (CV)0.4750730938
Kurtosis-1.307526882
Mean3
Median Absolute Deviation (MAD)1
Skewness0
Sum195
Variance2.03125
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
51320.0%
 
41320.0%
 
31320.0%
 
21320.0%
 
11320.0%
 
ValueCountFrequency (%) 
11320.0%
 
21320.0%
 
31320.0%
 
41320.0%
 
51320.0%
 
ValueCountFrequency (%) 
51320.0%
 
41320.0%
 
31320.0%
 
21320.0%
 
11320.0%
 

super_spreader_density
Real number (ℝ≥0)

Distinct count64
Unique (%)98.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean84.63312284615384
Minimum82.246742
Maximum86.709587
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum82.246742
5-th percentile82.6045848
Q183.656273
median84.895836
Q385.766144
95-th percentile86.2991962
Maximum86.709587
Range4.462845
Interquartile range (IQR)2.109871

Descriptive statistics

Standard deviation1.234920638
Coefficient of variation (CV)0.01459145778
Kurtosis-1.085808813
Mean84.63312285
Median Absolute Deviation (MAD)0.982574
Skewness-0.2615071237
Sum5501.152985
Variance1.525028983
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
84.21052623.1%
 
86.00614911.5%
 
85.82153311.5%
 
83.81245411.5%
 
85.81314811.5%
 
85.05747211.5%
 
85.8784111.5%
 
84.94078111.5%
 
83.31780211.5%
 
85.76614411.5%
 
Other values (54)5483.1%
 
ValueCountFrequency (%) 
82.24674211.5%
 
82.35815411.5%
 
82.48031611.5%
 
82.59351311.5%
 
82.64887211.5%
 
ValueCountFrequency (%) 
86.70958711.5%
 
86.51454211.5%
 
86.38233911.5%
 
86.30289511.5%
 
86.28440111.5%
 

daily_cases
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count60
Unique (%)92.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean478.10769230769233
Minimum7
Maximum3095
Zeros0
Zeros (%)0.0%
Memory size520.0 B

Quantile statistics

Minimum7
5-th percentile14.6
Q134
median110
Q3390
95-th percentile1964.2
Maximum3095
Range3088
Interquartile range (IQR)356

Descriptive statistics

Standard deviation750.0101067
Coefficient of variation (CV)1.568705375
Kurtosis2.070444702
Mean478.1076923
Median Absolute Deviation (MAD)82
Skewness1.763949764
Sum31077
Variance562515.1601
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2523.1%
 
3423.1%
 
1423.1%
 
3223.1%
 
7223.1%
 
2611.5%
 
17211.5%
 
4011.5%
 
3811.5%
 
3711.5%
 
Other values (50)5076.9%
 
ValueCountFrequency (%) 
711.5%
 
1111.5%
 
1423.1%
 
1711.5%
 
2011.5%
 
ValueCountFrequency (%) 
309511.5%
 
248411.5%
 
201411.5%
 
197411.5%
 
192511.5%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

countyfipsdatesuper_spreadertotal_businessescountystatecasesdeathscounty_namepopulationhouseholdsageincomefemalesover65whiteblackasianhispanicamerindianotherpovertyno_hsdiplomatsuper_spreader_densitydaily_cases
04400129mar2020425511BristolRhode Island110Bristol489001955344.27557825330.009406.045111.0621995.999941421.04326.03432.00023382.9998183.17025811
14400105apr2020432513BristolRhode Island250Bristol489001955344.27557825330.009406.045111.0621995.999941421.04326.03432.00023382.9998284.21052614
24400112apr2020416494BristolRhode Island390Bristol489001955344.27557825330.009406.045111.0621995.999941421.04326.03432.00023382.9998384.21052614
34400119apr2020427516BristolRhode Island650Bristol489001955344.27557825330.009406.045111.0621995.999941421.04326.03432.00023382.9998482.75193826
44400126apr2020444522BristolRhode Island1110Bristol489001955344.27557825330.009406.045111.0621995.999941421.04326.03432.00023382.9998585.05747246
5900129mar202085259878FairfieldConnecticut124521Fairfield94434834049140.392969484385.03143419.0588974.09941249375.00000182653.012575416.081751.000064890.0000186.3028951245
6900105apr202084179729FairfieldConnecticut305096Fairfield94434834049140.392969484385.03143419.0588974.09941249375.00000182653.012575416.081751.000064890.0000286.5145421805
7900112apr202083519631FairfieldConnecticut5534248Fairfield94434834049140.392969484385.03143419.0588974.09941249375.00000182653.012575416.081751.000064890.0000386.7095872484
8900119apr202086089965FairfieldConnecticut7434447Fairfield94434834049140.392969484385.03143419.0588974.09941249375.00000182653.012575416.081751.000064890.0000486.3823391900
9900126apr2020881210217FairfieldConnecticut10529707Fairfield94434834049140.392969484385.03143419.0588974.09941249375.00000182653.012575416.081751.000064890.0000586.2484133095

Last rows

countyfipsdatesuper_spreadertotal_businessescountystatecasesdeathscounty_namepopulationhouseholdsageincomefemalesover65whiteblackasianhispanicamerindianotherpovertyno_hsdiplomatsuper_spreader_densitydaily_cases
554400929mar202012511485WashingtoRhode Island210Washington1262424911144.58130165131.99624442.0115008.016402406.03914.0946320.011249.05058.0184.24242421
564400905apr202012501481WashingtoRhode Island530Washington1262424911144.58130165131.99624442.0115008.016402406.03914.0946320.011249.05058.0284.40242832
574400912apr202012121445WashingtoRhode Island1160Washington1262424911144.58130165131.99624442.0115008.016402406.03914.0946320.011249.05058.0383.87543563
584400919apr202012671509WashingtoRhode Island1880Washington1262424911144.58130165131.99624442.0115008.016402406.03914.0946320.011249.05058.0483.96289172
594400926apr202012671505WashingtoRhode Island2926Washington1262424911144.58130165131.99624442.0115008.016402406.03914.0946320.011249.05058.0584.186043104
60901529mar20209061089WindhamConnecticut70Wndham1165384444941.36477458978.00018257.096795.020841552.013368.06695.011691.09192.0183.1955957
61901505apr20208981085WindhamConnecticut321Wndham1165384444941.36477458978.00018257.096795.020841552.013368.06695.011691.09192.0282.76497725
62901512apr20208941073WindhamConnecticut661Wndham1165384444941.36477458978.00018257.096795.020841552.013368.06695.011691.09192.0383.31780234
63901519apr20209261117WindhamConnecticut1002Wndham1165384444941.36477458978.00018257.096795.020841552.013368.06695.011691.09192.0482.90062734
64901526apr20209291128WindhamConnecticut1574Wndham1165384444941.36477458978.00018257.096795.020841552.013368.06695.011691.09192.0582.35815457